Operational Characterization of Simultaneous Measurements in 

Quantum Mechanics 

Masanao Ozawa 

School of Informatics and Sciences, Nagoya University, Chikusa-ku, Nagoya 464-01, Japan 

(February 1, 2008) 



Abstract 



Quantum mechanics predicts the joint probability distribution of the out- 
comes of simultaneous measurements of commuting observables, but, in the 
state of the art, has lacked the operational definition of simultaneous measure- 
ments. The question is answered as to when the consecutive applications of 
measuring apparatuses give a simultaneous measurement of their observables. 
For this purpose, all the possible state reductions caused by measurements of 
an observable is also characterized by their operations. 
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In quantum mechanics, observables are represented by linear operators, for which the 
product operation is not commutative. If two observables are represented by commuting 
operators, they are simultaneously measurable and quantum mechanics predicts the joint 
probability distribution of the outcomes of their simultaneous measurement. But, it has not 
been answered fully what measurement can be considered as a simultaneous measurement 
of those observables. 

Let S be a quantum system with the Hilbert space TC of state vectors. Let A be an 
observable of S. For any Borel set A in the real line R, the spectral projection of A 
corresponding to A is denoted by E (A); if A has the Dirac type spectral representation 

^ = EE/^XM+E / A|A,i/)<MdA, (1) 

where \x varies over the discrete eigenvalues, A varies over the continuous eigenvalues, and v 
is the degeneracy parameter, then we have 

E A (A)=J2J2 lA*. H + E / |A,^)(A,H^A. (2) 

v ,u€A v JA 

For any real number t, we shall denote by p(t) the state (density operator) of S at the time t. 
We shall denote by u A(t) G A" the probabilistic event that the outcome of the measurement 
(under consideration) of an observable A at the time t is in a Borel set A. According to the 
Born statistical formula, we have 

Pi{A(t) e A} = Tr[E A (A)p(t)]. (3) 

Any commuting observables A and B are simultaneously measurable and the joint prob- 
ability distribution of the outcomes of their simultaneous measurement is postulated by 

Pr{A{t) e A, B(t) e A'} = Tt[E a (A)E b (A')p(t)]. (4) 

A well-known proof of this quantum rule runs as follows Since A and B commute, 
there exist an observable C and Borel functions / and g such that A = /(C) and B = g(C). 
For the outcome c of the C-measurement, define the outcome of the A- measurement to be 
/(c) and the outcome of the £>-measurement to be g(c). Then, it follows from (|3|) that 
their outcomes satisfy (|^) so that the measurement of C at the time t gives a simultaneous 
measurement of A and B. 

The above proof gives one special instance of simultaneous measurement which uses only 
one measuring apparatus, but it is rather open when a pair of measuring apparatuses for 
A and B make a simultaneous measurement of A and B. The purpose of this paper is to 
answer this question. 

The conventional approach to the problem assumes the projection postulate formulated 
by Luders [0 as follows: If an observable A is measured in a state p, then at the time just 
after measurement the object is left in the state 

E A ({a})pE A ({a}) 
Tr[E A ({a})p] ' 
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provided that the object leads to the outcome a. 

Suppose that at time t an observable A is measured by an apparatus satisfying the 
projection postulate and that at the time, t + At, just after the A-measurement an arbitrary 
observable B is measured. The joint probability distribution of the outcomes of the A- 
measurement and the 5-measurement is given by 

Pr{A(t) G A,B(t + At) G A'} 

= £ Tr[E B (A)E A ({a})p(t)E A ({a})\. (5) 

aeA 

Thus, if A and B commute, we have 

Pr{A(t) G A, B(t + At) G A'} = Tr[E A (A)E B (A')p(t)}. (6) 

We can therefore reinterpret the i?-measurement at the time t+At as the S-measurement at 
the time t and consider the above consecutive measurements of A and B as the simultaneous 
measurement of A and B at the time t. 

If we would restrict measurements to those satisfying the projection postulate, any con- 
secutive measurements of commuting observables could be considered as simultaneous mea- 
surement, but this approach has the following limitations: 

(i) Some of the most familiar measuring apparatuses such as photon counters do not 
satisfy the projection postulate ||. 

(ii) When the observable has continuous spectrum, no measurements satisfy the projec- 
tion postulate ||]. 

The above limitations appear indeed quite serious. From (i), we cannot apply (|4]) to 
correlation measurements using photon counters as in most of optical experiments and EPR- 
correlation measurements |0-|§. From (ii), we cannot use @ for any measurements of 
continuous observables using more than one apparatuses. 

Now we shall abandon the projection postulate and consider the following problem: under 
what condition can consecutive measurements of two or more observables be considered as a 
simultaneous measurement of those observables? 

Suppose that an observable A of S is measured at time t by a measuring apparatus A 
with the state space H,a- Let t + At be the time just after measurement. The measurement 
is carried out by the interaction between S and A from t to t + At, and after the time 
t + At the object S is free from the apparatus A. Suppose that the apparatus is in the state 
a at the time t and let U be the unitary operator representing the time evolution of the 
composite system S + A from t to t + At. Then, the object S is in the state 

p(t + At) = Tr A [U(p(t) <g> a) tf\ (7) 

at the time t + At, where Tr A is the partial trace over 7^ A - The state change p(t) i— > p(t+ At) 
is determined independent of the outcome of measurement and called the nonselective state 
change. 

Let B be an arbitrary observable of S. We say that the measurement of A using the 
apparatus A does not disturb the observable B iff the nonselective state change does not 
perturb the probability distribution of B, that is, we have 

Ti[E B (A)e~ lHAt / n p(t)e iHAt/h } = Ti[E B (A)p(t + At)] (8) 
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for any Borel set A, where H is the Hamiltonian of the system S. The measurement is said 
to be instantaneous iff the duration At of measuring interaction is negligible in the time 
scale of the time evolution of the object. Thus, the instantaneous measurement of A using 
A does not disturb B if and only if 

Tr[E B (A)p(t)} = Tr[E B (A)p(t + At)] (9) 

for any Borel set A. 

By (0) and by the property of the partial trace, we have 

Tr[E B (A)p(t + At)] 

= Tr [E B (A)Tr A [U(p(t) <g> a)U^\ 

= Tr [TrA^^A) ® I)U{I ® a)\p{t)] . (10) 
Since p{t) is arbitrary, @ is equivalent to 

E B (A) = Tr A [[/t(£ B (A) ® I)U(I ® a)] (11) 

for any Borel set A. 

Now we can state precisely the answer to the above problem to be obtained in the present 
paper. 

Theorem 1. The instantaneous measurement of an observable A at time t using an 
apparatus A does not disturb an observable B if and only if we have 

Pr{A(t) G A,B(t + At) G A'} 

= Ti[E A (A)E B (A')p(t)} (12) 

for any density operator p(t) and any Borel sets A and A'. In this case, A and B necessarily 
commute. 

In what follows, we shall characterize all the possible state reductions caused by mea- 
surements of an observables in order to provide the proof of the above theorem, 

For any Borel set A, let pit + At\A{t) G A) be the state at t + At of the object S 
conditional upon A(t) G A. Thus, if the object S is sampled randomly from the subensemble 
of the similar systems that yield the outcome of the A-measurement in the Borel set A, then 
S is in the state p(t + At\A(t) G A) at the time t + At. When A = R, the condition A(t) G A 
makes no selection, and hence we have 

p(t + At\A(t) G R) = p{t + At). (13) 

For A^R, the state change p(t) i— > pit + At\A{t) G A) is called the selective state change. 
When Vi{A{t) G A} = 0, the state p(t+At\A(t) G A) is indefinite, and let p{t+At\A{t) G A) 
be an arbitrarily chosen density operator for mathematical convenience. 

Davies and Lewis postulated that for any Borel set A there exists a positive linear 
transformation Ta on the space tc(H) of trace class operators on 7i satisfying the following 
conditions: 

(i) For any Borel set A and disjoint Borel sets A n such that A = \J n A n and for any 
p G tc(H), 
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7a(p) = £?a>). 

n 



(14a) 



(ii) For any Borel set A and any p G rc(7i), 

Tr[T A (p)) = Tr[E A (A)p}. (14b) 

(iii) For any Borel set A with Pr{,4(f) G A} > 0, 

p{t+AtWt)eA) = ^Mkr (14c) 

The validity of the Davies-Lewis postulate was previously demonstrated in [Q based on 
the joint probability formula, where it is also shown that any such transformations T A which 
are realizable by a measuring process are completely positive and vice versa. In what follows, 
we shall prove the Davies-Lewis postulate without assuming the joint probability formula so 
as to avoid the circular argument. 

Suppose that at the time t + At the observer were to measure an arbitrary observable 
B of the same object S. Then, the joint probability distribution of the outcomes of the 
^-measurement and the £>-measurement satisfies 

Yi{A{t) G A,B(t + At) G A'} 

= Tr[£ B (A')p(t + At\A{t) G A)]Tr [E A (A) p(t)]. (15) 

For any Borel set A, let T^[p(t)} be the trace class operator defined by 

T A [p(t)] = Tr[E A (A)p(t)]p(t + At\A(t) G A). (16) 

Then, flTED defines the transformation T A that maps p(t) to T A [p(t)}. It follows from ( |T5D 
and fll6|) that T A satisfies 

Pr{A{t) G A, B{t + At) G A'} = Tr[E B (A')T A [p(t)}}. (17) 

Suppose that the state pit) is a mixture of the states p\ and p2, i.e., 

p{t) = a Pl + (1 - a)p 2 (18) 

where < a < 1. This means that at the time t the measured object S is sampled randomly 
from an ensemble of similar systems described by the density operator p\ with probability 
a and from another ensemble described by the density operator p 2 with probability 1 — a. 
Thus we have naturally 

Pr{A(t) G A, Bit + At) G A'\p(t) = a Pl + (1 - a)p 2 } 
= a Pr{A(t) G A, Bit + At) G A'\p(t) = Pl } 

+ (1 - a) Pr{A(t) G A, B(t + At) G A'\p(t) = p 2 }, 

(19) 



where Pt{E\F} stands for the conditional probability of E given F. From (|17D and (|T^ 
we have 
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Tr 



E B (A')T A [api + (1 - a)p 2 ] 



Tr 



E B (A') [aT A ( Pl ) + (l-a)T A (p 2 )] 



Since B and A' are arbitrary, we have 



T A [api + (1 - OL)p 2 ] = aT A (pi) + (1 - a)T A (p 2 ) 



(20) 



(21) 



Thus, T A is an affine transformation from the space of density operators to the space of trace 
class operators, and hence it can be extended to a unique positive linear transformation of 
the trace class operators 
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We have proved that for any apparatus A measuring A there is uniquely a family 
{T A \ A G B(K)} of positive linear transformations of the trace class operators such that 
(|IT)|) and (|T^) hold, where £>(R) stands for the collection of all Borel sets. This family of 
linear transformations will be referred to as the operational distribution of the apparatus A. 

By the countable additivity of probability, if A = \J n A n for disjoint Borel sets A„, we 
have 



Pr{A(t) G A,B(t + At) G A'} 

= Pr{A(t) G A n , B(t + At) G A'}. 



(22) 



By ( |TTD and (^) we can prove ( |14a| ). Equations (|14b| ) and ( |14c| ) are obvious from (|i~6|). 
Thus, the operational distribution {T A \ A G B(R)} satisfies the Davies-Lewis postulate. 

Mathematical theory of operational distributions was introduced by Davies and Lewis || 
based on relation ( |14a|) as a mathematical axiom and developed in ||11|| . Theory of measuring 
processes based on completely positive operational distributions was developed extensively 
in i,[12HH- 

Now we are ready to state the following important relations for operational distributions. 

Theorem 2. Let {T A \ A G B(K)} be a family of positive linear transformations on 
rc{7i) satisfying ( \14<j ) and ( \14ty - Then, for any Borel set A and any trace class operator p 
we have 



T A (p) = T R (V(A)p) = T R (pE A (A)) 
= T R (E A (A)pE A (A)) . 



(23) 



A proof of the above theorem was given in H for the case where Ta is completely 



positive, and another proof was given in |L9| for the case where A is discrete. The general 



proof necessary for the above theorem is obtained by modifying the above proofs. 
By (0), (0), and (JT|) we have 

T R (p) = Ti[U(p®a)W] (24) 

for any p G rc{7i). From ( |14c|) , (|23|), and (p4[), we obtain the following characterization of 
the possible selective state changes: if~Pi{A{t) G A} > ; the state p{t + At\A{t) G A) is 
uniquely determined as 
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p(t + At\A(t) e A) 

Tr A [U(p(t)E A {A)®cr)W] 

Tr[E A {A)p{t)] 
Ti A [U(E A (A)p(t)®a)U'i} 

Tr[£^(A)p(t)] 
Tr A [U{E A (A) p{t)E A (A) g g)f/t] 
= Tr[^(A)p(t)] 



(25a) 
(25b) 
(25c) 



Now, we are ready to prove our main theorem. 

Proof of Theorem 1. It suffices to show the equivalence between (|ll]) and (|T2]). First, 
note that from (|15|) and (|25b| ) , the joint probability distribution of A and B is given by 

Pi{A{t) G A,B(t + At) G A'} 

= Tr [E B (A')Ti A [U(E A (A)p(t) ® a)U ] ] 

= Tr [£ A (A)Tr A [L/t(£ B (A') ® I)U(I ® o-)]/>(t)' . 

(26) 

If ( PJ) holds, ( [121) follows from (|^). Conversely, suppose that ( |12"1) holds. By substituting 
A = R in ([121), we have 



Pr{A(t) G R, B(t + At) G A'} = Tr[E B (A')p(t)]. (27) 
On the other hand, from (pBl) we have 



Pr{A(t) G R,5(t + At) G A'} 



Tr 



Tr A [C/t(^( A ) g, j^jj^j g, _ ( 28 ) 



Since p(t) is arbitrary, from (]27|) and (|28| ) we obtain ([TT|). Therefore, (|TT|) and (|T2]) are 
equivalent. To prove A and B commute, suppose that ( |T2"D holds. By the positivity of 
probability, the right hand side is positive. Since pit) is arbitrary, the product E A (A)E B (A') 
is a positive self-adjoint operator so that E (A) and E B (A') commute. Since A and A' are 
arbitrary, A and B commute. □ 

Obviously from flTTl), if U and B ® I commute, i.e., 



[U, E (A) <g) /] = (29) 

for any Borel set A, then the A-measurement does not disturb the observable B. However, 
(f29l) is not a necessary condition for nondisturbing measurement. In the case where a is 
a pure state a = it follows from ([□]) that the A-measurement does not disturb the 

observable B if and only if 

[U,E B (A)®I}\m) = (30) 

for any Borel set A and any state vector of S. 

Consider the case where the system S consists of two subsystems Si with state space Hi 
and S 2 with state space 7i 2 so that TC = Hi <E> Ti-2- Suppose that A belongs to Si and B 
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belongs to S2 so that A = C (g> I and B = I Cg> D for some C on 7ii and for some .D on H.2. In 
this case, we say that the ^-measurement using the apparatus A is local iff the measuring 
interaction couples only Si and A. Thus, the ^-measurement is instantaneous and local if 
and only if we have 

[u,i 1 ®x®r\ = o, (31) 

for any operator X on 7i 2 , where I\ is the identity on Tii and I is the identity on T~Ca- From 
and Q5T), any local instantaneous measurement of A does not disturb B and any local 



instantaneous measurement of B does not disturb A either. Therefore, it is concluded that 
any pair of local instantaneous measurements of A = I <g> C and B = I <g> D satisfies the 
joint probability formula 

Pi{A{t) e A, B{t) e A'} = Tr[(£ c (A) ® E D (A'))p(t)}, (32) 

regardless of the order of the measurement. 

In the EPR paper ||20|| , the so called EPR correlation is derived theoretically under 
the assumption that the pair of measurements satisfies the projection postulate, but the 
present result concludes that the EPR correlation holds for any pair of local instantaneous 
measurements as experiments have already suggested. 
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